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We address the task of switching off the Hamiltonian of a system by removing all internal and 
system-environment couplings. We propose dynamical decoupling schemes, that use only bounded- 
strength controls, for quantum many-body systems with local system Hamiltonians and local envi¬ 
ronmental couplings. To do so, we introduce the combinatorial concept of balanced-cycle orthogonal 
arrays (BOAs) and show how to construct them from classical error-correcting codes. The derived 
decoupling schemes may be useful as a primitive for more complex schemes, e.g., for Hamiltonian 
simulation. For the case of n qubits and a 2-local Hamiltonian, the length of the resulting decoupling 
scheme scales as 0{n\ogn), improving over the previously best-known schemes that scaled quadrat- 
ically with n. More generally, using balanced-cycle orthogonal arrays constructed from families of 
BCH codes, we show that bounded-strength decoupling for any Alocal Hamiltonian, where £ ^ 2, 
can be achieved using decoupling schemes of length at most 0{n^~^ logn). 

PACS numbers: 03.67.Lx, 03.65.Fd, 03.67.-a 


I. INTRODUCTION 

Consider a quantum system of n interacting d-dimensional qudits with a time-independent (possibly unknown) 
Hamiltonian H acting on a Hilbert space Tf, = We make the assumption that the system is Ulocal, i.e. that 

H can be written as the sum of operators, each of which acts only on £ of the n qudits. In nature it is usually the 
case that i is small even when n is large. Without loss of generality, we also take H to be traceless, and for technical 
reasons, we assume that d is a prime power (which includes the important case of qubits, i.e. d = 2). 

We consider the task of decoupling, i.e. effectively switching off the Hamiltonian H (including removing any couplings 
to the environment) so that the system effectively evolves under the zero Hamiltonian. Such a task is important, for 
example, in the context of quantum memory, where one desires to preserve the state of a quantum system. 

To achieve this task, we assume that the natural dynamics of the system can be modified by adjoining an open-loop 
(non-feedback) controller according to 


H ^ H + Hc{t). 

In practice, physical limitations restrict the types of control Hamiltonians available for use. We consider the realistic 
setting in which Hc{t) is only 1-local, i.e. due to our limited control of the system, He is the sum of operators that 
each act on only one qudit. We further impose the constraint that our control Hamiltonian Hc(t) is limited to be 
bounded-strength, i.e. a sufficiently smooth bounded function. This is in contrast to the setting of bang-bang control 
in which Hc{t) can be a discontinuous function that takes values of arbitrarily large norm. Our assumptions that 
the system Hamiltonian is an Alocal Hamiltonian acting on a system of n interacting qudits and that the control 
Hamiltonian is a 1-local bounded-strength Hamiltonian reflect the typical composite nature of quantum systems and 
their coupling locality as well as the limitations in implementing external controls. 

Viola and Knill proposed a general method for bounded-strength decoupling; see [T] and Chapter 4]. Their 
method, often referred to as Eulerian decoupling, relies on Eulerian cycles in Cayley graphs of a control group — 
a certain finite group of control unitaries that can be implemented by switching on control Hamiltonians, from a 
finite set of available control operations, for a fixed time. The Eulerian cycle dictates which control Hamiltonians are 
applied in the different time-slots of the decoupling protocol. 

The Eulerian method, as introduced in |T], does not make it possible to directly leverage the fact that the system 
Hamiltonian is Alocal in order to obtain more efficient decoupling schemes. However, in the setting of bang-bang 
control there do exist efficient decoupling schemes that are specifically designed for composite quantum systems with 
£-local system Hamiltonians; see mg and [21 Chapter 15]. In these schemes, the specification of which bang-bang 
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control unitaries are to be applied is chosen according to the entries of so-called orthogonal arrays of strength £. They 
are matrices with the property that any submatrix formed by an arbitrary collection of £ rows satisfies a certain 
balancedness condition. 

The work [5] presented a particular construction of decoupling schemes merging the approaches of Eulerian 
(bounded-strength) decoupling together with orthogonal array (bang-bang) decoupling. This construction yields 
schemes that require only bounded-strength controls and exploit the composite structure of the quantum system 
(namely, the locality of the system Hamiltonian) to achieve decoupling with fewer control operations. To do so, these 
schemes introduce the concept of so-called Eulerian orthogonal arrays. 

The purpose of the present paper is to further improve upon the method of |3] to obtain even more efficient bounded- 
strength decoupling schemes. To this end, we first generalize the Eulerian method due to [1] by showing that it is also 
possible to achieve decoupling with the help of so-called balanced cycles, which encompass Eulerian cycles as a special 
case. We then show that bounded-strength decoupling of composite quantum systems with local Hamiltonians can 
be accomplished based on the new concept of balanced-cycle orthogonal arrays. 

Note that all the schemes discussed above can also be applied to the situation of a general open quantum system 
with joint Hamiltonian H acting on a quantum system that is coupled to an uncontrollable environment. Such a 
Hamiltonian has the form 

H = Hs®lB + ts ®Hb + ^ 

cx. 

where the operators Hg and act on the system and where the operators Hb and act on the environment. We 
assume that the system Hamiltonian Hs and the operators Sa are all £-local. The decoupling goal in this case is to 
effectively switch off the system Hamiltonian Hs and remove all couplings to the environment. If, using controls that 
act only on the system, one can effectively switch off all generic system Hamiltonians, then such an operation will 
switch off Hs and each Sa, thereby accomplishing decoupling.^ For notational simplicity, the remainder of the paper 
will therefore ignore the environment and treat only the case of effectively switching off an arbitrary £-local operator 
H. 


II. DESCRIPTION OF THE CONTROL-THEORETIC MODEL 

Consider the group (F^,-!-), the additive group of the finite field of order q = df, where d (the dimension of the 
qudits) is some prime power. For the remainder of this paper, let p : ¥q ^ U{d) he a, faithful, irreducible, unitary, 
projective^ representation that maps the elements of ¥q to d x d unitary matrices, say p : g ^ Ug. That q cannot be 
smaller than d? for such a representation will be justified later in Remark that q = df suffices is justified by the 
explicit example shown below. 

We assume that for every 5 G we can implement Ug on any qudit of our system in the following sense: for every g, 
we can physically implement, over time 5 £ [0, A], a bounded-strength single-qudit Hamiltonian hg{6), corresponding 
to a single-qudit unitary evolution operator Ug{S), such that Ug = Ug{A) where A is some fixed length of time. We 
assume that we can do this on any qudit and, moreover, that we can do so for each of the n qudits in parallel. Note 
that this assumption obeys the practical control limitations discussed earlier. 

Of particular interest, in the case of qubits {d = 2, q = 4) we can consider the representation p : F 4 —>■ F/(2) 
that maps the four elements of F 4 to the four 2x2 Pauli matrices {1, X,Y, Z}. Thus, it is assumed that we can 
physically implement any Pauli operator on any qubit. Rather than assuming that q = 4, this paper will treat 
q more generally; however the reader is invited to think of the special case of qubits if desired. For non-qubits, 
with q > 4, we can generalize this example as follows. For a prime p, define X = \d ^ ™odp)(jj and 

Z = where a; is a p**' root of unity. For prime d = p, the map (a, 5) 1 — X°'Z’’ defines a faithful, 

irreducible, unitary, projective representation from J.d x to U{d). For a prime power d = p^ (for some e), map 
((ai, 61),..., (Oe, be)) 0 • • • (g) X^^Z’’^. 

A decoupling protocol is defined by specifying a sequence of control Hamiltonians (equivalently, control unitaries) 
to be applied. As shown in Fig. we construct an. nx N array with entries from F^, which we regard as a sequence 
of N columns from F^. The jth column gj = {gij,... ,gnj)'^ corresponds to the jth time interval [{j — 1)A, jA] of 
our protocol, during which we apply the control Hamiltonian 

(5) = hg,^ (5)010---0II 10---01 ((5) 


^ The remaining Hamiltonian term of Ig is inconsequential, as it does not affect the system at all. 

^ Projective representations need only be homomorphisms up to phase, i.e. obey Ug^f, UgUh with proportionality rather than equality. 
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FIG. 1: An nx N array, with each entry gij £ fq, shown within the dashed lines. Rows correspond to qudit numbers, columns 
to time slots (each of width A). This array encapsulates the control sequence, with Hc{t) = over 5 £ [0, A) during the 

interval t £ [(j — 1)A, jA). 


that gives rise to evolution ug- (5) = Ug^. (5) 0 • • • 0 Ug^. (S) over <5 £ [0, A] . In other words, for each 6 £ [0, A] and 
j = 1,... ,N, Hqit) = (i5) where t = {j — 1) A + 5. The total time required to apply the entire sequence, i.e. the 

control cycle length, is therefore Tq = NA, at which point the control sequence can be repeated. Observe that for 
any t = [j — 1)A + 5, the unitary evolution Uc{t) corresponding to the control Hamiltonian consequently satisfies 
Uq{t)=U^^{5)Uq{{j-l)A). ^ 

According to average Hamiltonian theory CHS], the resulting system evolution under H + Hc{t) can be effectively 
approximated by 

U{t) « 

at times t that are integer multiples of T^, i.e. t = mTc for any m S N, where 

^ ' Ucit)^ HUq{t)dt 

Jt=0 

is time-independent and where Uc{t) is the time evolution due to alone. The goal of decoupling, therefore, is 

to choose Uc(t) such that = 0 for any H. It is in this sense that we effectively switch off the Hamiltonian H. 
We refer the reader to m for a detailed description of the above control-theoretic model and the resulting effective 
time-evolution. We note, in particular, that although the approximation above is to leading order (in the Magnus 
expansion of Uc{tyHUc{t)), the second-order term may be eliminated by designing the control Hamiltonian to satisfy 
Uq(t)=Uq{Tq-t) |S|. 

The efficiency of the protocol developed in this paper is obtained by exploiting the composite structure of the 
Hamiltonian, namely the fact that H was assumed to be a local Hamiltonian. By definition, an f'-local Hamiltonian 
H on n qudits can be written as H = where each Hk acts non-trivially on at most i of the n qudits. In 

particular, the i = 2 case corresponds to Hamiltonians with only pairwise interactions. Our goal is to create a protocol 
that decouples each Hk simultaneously, and therefore decouples H. To see that this would work, observe that for any 
protocol Uc{t), 


Hio) 


^ J U,it)^HUq{t)dt = E ^ / 


Uq{t)^HkUq{t)dt^Y.^k^- 

k 


III. BALANCED CYCLES 

The success of the decoupling protocol introduced in this paper will rely on some basic group theory, which we 
introduce now. Let Q be an Abelian group with a generating set G G, i.e. any element of G can be written as a 
sum of elements from .5^. 

Definition 1 (Cayley graph). The Cayley graph, r(^, S^), of G with respect to .5^ is a directed graph whose vertices 
are labeled by the group elements and whose edges are labeled by the generators. More precisely, there is a directed 
edge labeled from vertex a £ G to vertex k £ G iff A. = .6 -I- a for the generator £ SG. 
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Definition 2 (Cycle). A cycZe, on r(Q,J^) is a traversal on F that starts and ends on the same vertex. We 
describe the cycle by the ordered list of elements from indicating the order in which the 

elements are visited, with the understanding that the cycle visits immediately after visiting All the 

cycles in this paper visit every vertex at least once, so we assume without loss of generality that the first vertex is the 
identity element, e, of Q. With this assumption we may equivalently represent the cycle '£g by specifying the edges 
traversed, i.e. = (. 61 ,... ,^n), where ^ = jij + ^ ^ for j = 1 ,..., A^; note that we differentiate between these 

representations by the subscript on but they both refer to the same cycle. 

Note that a cycle may visit vertices more than once and may traverse edges multiple times. We will be interested 
not only in the vertices, but also the specific labels leaving each vertex; we denote by the . 6 -labeled edge leaving 
vertex 

Definition 3 (Balanced cycle). We say that ^ is a balanced cycle if V.f. G > 0 such that G Q, occurs 

exactly times; in other words, the cycle is balanced if it is balanced with respect to each label G in the sense 
that it leaves each ^ via label ^ an equal number of times (independent of ^). Consequently, each ^ will appear in ^ 
precisely A = independent of Because a Cayley graph is a regular directed graph, it always has a 

balanced cycle whose length is then necessarily N = A|^|. 
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(b)Balanced cycle 

FIG. 2: (a) An Eulerian cycle on the Cayley graph r^Z 2 ,'|^o^, ^ balanced cycle in which each edge 

label leaves each vertex precisely once. Vertices correspond to the eight elements of Z|. Edge labels correspond to the three 
generators, namely ^ 0 ^ (purple), (green), and ^ 0 ^ (blue). The cycle starts at ^ 0 ^ and follows the path indicated (in 

ascending numerical order) by the circled integers (red), (b) A balanced cycle on the Cayley graph F^Z^, | | j . Vertices 

correspond to the four elements of Z 2 . Edge labels correspond to the two generators, namely (purple) and (blue). The 

cycle starts at and follows the path indicated (in ascending numerical order) by the circled integers (red). Observe that the 
cycle is indeed balanced: for each of the two edge labels, the edges leave each vertex the same number of times, irrespective of 
vertex. Specifically, the label leaves each vertex precisely ~ ^ times, while the label leaves each vertex precisely 

‘(!) 


An important special case of a balanced cycle is an Eulerian cycle on for which /r.* = 1 for every .6 G =5^. 

Examples of an Eulerian cycle and a non-Eulerian balanced cycle are shown in Fig. 2(a) and Fig. |2(b)| respectively. 
In [T] , Eulerian cycles were used to define decoupling protocols that avoided the discontinuous nature of bang-bang 
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decoupling. More generally, one can define decoupling protocols based on balanced cycles (of which Eulerian decoupling 
is a special case), to which we now turn our attention. Note, however, that this balanced-cycle decoupling protocol 
will not be the goal of this paper. Indeed, such a protocol will not exploit the composite structure of the Hamiltonian. 
Later we will utilize the balanced-cycle decoupling on ^-qudit subsystems of a larger n qudit space to develop more 
efficient protocols; in the current section, however, we may regard i as the size of the entire system. 

In exploiting the £-local nature of H, we will find that we are primarily interested in the group 

G = fl = {(oi,..., ae)'^ : a* G F,} 

with some generating set =5^ and the representation 

,g ^ U{S) 

defined from our representation p : Fg —)• U{d). Specifically, if ^ = (ai, • ■ • G G and p{ai) = Ua^ then = 

= Ua-^ ®- ■ ■®Uai- By our assumptions above, we can physically implement by applying the control unitary u^((5) 

(equivalently, the control Hamiltonian h^{S)) for time A. For example, in the case of qubits {q = 4), the group F 4 , 

whose elements we denote^ as {0,1, a, a -I-1}, is generated by the set S 4 = {1, a}. We choose y = {1^, a^,... 1^, a^}, 
which is a generating set of 2 £ elements for the group G = F 4 , where x’' here denotes the column (0,..., 0, a;, 0,..., 0)^ 
with a: G Fq in the ith position. In this case we assume p{ai) = Uai is a Pauli matrix, so p®^(^) is a tensor product of 
Pauli matrices. 

The purpose for the group theory used in this paper resides in the following observation [2 Chapter 4]. We define 
the operator Hg to act on matrices A as 


Be (A) 



( 1 ) 


Note that for every matrix A, Ilg{A) commutes with all (^ G G)- Thus, by Schur’s lemma, since p is irreducible^, 

we have Iig{A) = 1 (where D is the dimension of the Hilbert space). In particular then, if tr(H) = 0 then 

ne(H) = 0. 


Protocol 1 (Bounded-strength balanced-cycle decoupling). Let be a balanced cycle on r(5,o5^) of length 
N = \G\ with group element representation '£g = ... ,^^ and generator representation = 

(. 61 ,... ,^n)- For j = 1,... ,N, set Uc{0) = 11^ = t and 


17e((j - 1)A + s)= u^d) Uc({j - 1)a) , <5 G [0, A]. 


Note that because^ U^ Un = CA +n = Un , this implies Uc{jA) = Un (for j = 0,... ,N), i.e. 

^ Oj-i ■’ Oj-i dj dj 


- 1)A-F = u^^.(5) 17^ (5 g[0,A]. 


( 2 ) 


The control cycle length is thus Tc = NA = |^|AA. 

Theorem 1. The above balaneed-eycle protocol performs bounded-strength decoupling. 


^ Here the reader may prefer to equivalently think of the Abelian group as {l,z,x,y = xz = zx} with generating set 54 = {z,x}. Then 
we can use ^ = {z^^\x^^\ ... z^^fx^^'^} and p{x) = X^p{y) = Y, and p{z) = Z. Be aware, however, that the group operation used 
throughout the paper is denoted by + rather than by multiplication, since it is inherited from the finite field. 

Schur’s lemma guarantees this directly when i = 1. But then it also applies for i = 2 since then for any matrix A = Ti 0 Ci, we 
have Ug{A) = X]* Zlai,a 2 GFq 0 Ul^CiUa -2 TA trBitrCj = trj]. Bi®Ci = trA, and similarly for larger C.. 

^ up to phase, since p is a projective representation; since we will only ever conjugate by Uc, the overall phase is irrelevant and we shall 
simply ignore it. 
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Proof. Since ^ is a balanced cycle, occurs exactly times for every pair. Thus M.i,(i5)C/^ appears exactly /i^ 
times in the protocol for each .6,n, and so we have, for any traceless df x df Hamiltonian H, 


^ ^ Uc{t)^HUc{t)dt 

-^c Jt^O 


= MSyHuj,{S)dd 

= ne(F^(7?)) 

where Hg is defined in Eq. Q and Fy is defined by 

FAH) = y^^ r uAVHuA)dS. 

, Js=o 


Un 


(3) 


Recall that Hg suppresses traceless matrices. Assuming that H is traceless, and observing that is trace-preserving, 
we have that Ilg(^F^{F[)^ = 0. We conclude that = 0, i.e. the protocol succeeds at decoupling. □ 


Remark 1. For simplicity, we have assumed that p is irreducible. Then this protocol works for any traceless time- 
independent iJ, even if FI is unknown. It is possible to define protocols in which p is not irreducible, in which case Hg 
need not suppress all traceless matrices. However, in such a case, one must take special care to ensure that Hg still 
suppresses Fy>{F[) for the Hamiltonians of interest. See [TD] for examples in a similar context, as well as Example]^ 
later in this paper. 


Remark 2. Although Protocol[2performs bounded-strength decoupling, it would generally not be an efficient protocol 
were it applied to the entire system (i.e. if I were the number of qudits of the entire system). Assuming that p is 
irreducible, the representation : Q —>■ U{d^) necessitates that \Q\, and therefore Tc, are exponential in £. Indeed, 
suppose we have a representation from Q to U{D) such that for any D x D matrix A, Ilg{A) = Id as we used 


in Theorem 1 Consider sending the bipartite entangled state \tp) = \j) ® b)> or more precisely, = |'0)(V'|) 

through the channel 1 0 Hg (where I is the identity channel on a Z^-dimensional space) obtaining 


^ ®c/^) = (x® ne)(^') 


1 

—z Id® Id 


1 

IF 


• 


The matrix rank of the right-hand side is . Using the fact that rank(A-l-H) ^ rank(A) -f rank(i?) and that for each 
rank(|^(l C)[/^)'I'(1 (g)U^)) = rank(t[') = 1, the rank of the left-hand side is at most |C/|; thus, \Q\ ^ D^. Therefore, 

for the representation p®^ : Q — > U{d^) to succeed in the proof of Theorem [I we require that |0| ^ which is 
exponential in Incidentally, by considering the case oi i = 1, we have justified why we could not have chosen q less 
than d^ in our irreducible representation p : ¥q ^ U (d). 


Observe that the key to this protocol working is the fact that each 


shows up an equal number of times. 


independent of i.e. 'ih ^ 5^ 3ps > 0 such that S t/, occurs p.* times (independent of ^). In an Eulerian 
cycle, = 1 for every h, which is certainly sufficient. All else being equal, given the choice between Eulerian and 
other balanced cycles, we would choose Eulerian cycles as they will minimize N and therefore Tf.. However, we will 
see that when considering the composite properties of a system (specifically that interactions are local), we will be 
able to exploit the notion of balanced cycles to come up with a much more efficient protocol. 


IV. BALANCED-CYCLE ORTHOGONAL ARRAYS 

In Section and Fig. we indicated how we view our decoupling scheme as an array. For the protocol to be 
efficient, we shall ensure that this array corresponds to what we call a balanced-cycle orthogonal array (BOA). A 
BOA is a special type of orthogonal array (OA), which we first define. We refer the reader to [TT] for a thorough 
introduction to OAs, particularly their relationship to linear codes (of which we shall later make use). 
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FIG. 3: Example of a 0 ^ 2 ( 8 , 7, 2, 2), i.e. a OA\{N,n,q,£) with N = 8 columns and n = 7 rows on the finite field ¥q = Z 2 
of order q = 2. Any subarray defined by any 1 = 2 rows contains each 2-tuple precisely A = 2 times. For example, rows 5 and 
7 (highlighted) form a 2 x 8 subarray in which and each occur precisely twice. Note that typically in this 

paper, q = dF (for example, for qubits g = 4), but for simplicity, the example in this figure uses q = 2. 


For notational consistency, we point out that throughout the remainder of this paper we adopt the notation that 
Q and 5^ refer specifically to the group and a generating set for F^, respectively. Elements of Q will be denoted 
using script elements of will be denoted using script 21 , and cycles on Q will be denoted When other groups 
(such as Fg or F^) are being considered, other notation (such as g, m, S, s and C) will be used instead. 

Definition 4 (Orthogonal array). An OA\{N, n, q, i) orthogonal array on the alphabet F^ is an n x array where 
each of the N columns is a vector from F” such that every £ x N subarray (obtained by only considering a selection 
of just £ of the n rows) contains each possible ^-tuple of elements of F^ (i.e. contains each c G F^) precisely A times 
as a column. The number £ is called the strength of the OA. 

Remark 3. To relate these numbers to those appearing elsewhere in this paper, 

• N will correspond to number of steps in the decoupling protocol (i.e. the length of our balanced cycle), 

• n will correspond to the number of d-dimensional qudits describing the system, 

• q = (P (e.g. for qubits, d = 2 and <7 = 4), 

• £ is the locality of the Hamiltonian (e.g. for pairwise interactions, £ = 2), and 

• A = N/q^ will be the same A as in our discussion of balanced cycles, A = 

Remark 4. Note that the order of the columns in the OA is irrelevant to whether the array is an OA. Moreover, if 
A = [di] is an OA\{N, n, q, £) with columns di,..., Um then the matrix A', whose columns consist of precisely r copies 
of each di (in any order), is an OAr\{rN,n,q,£). Note, however, that while the order of the columns does not affect 
the OA property of the array, when defining balanced-cycle orthogonal arrays (which we do next), we will be highly 
concerned with the order of the columns in the array. 

An example of an 0 ^ 2 ( 8 , 7, 2, 2) is shown in Fig. Orthogonal arrays have been used to construct bang-bang 
decoupling schemes (see [3H5] and [3 Chapter 15]). In order to construct a bounded-strength scheme, we introduce 
the notion of a balanced-cycle orthogonal array, defined as follows. 

Definition 5 (Balanced-cycle orthogonal array). A BOA{N,n,q,£) balanced-cycle orthogonal array on the 
alphabet F^ is an n x iV array, A, where each of the N columns is a vector from F^ such that every £ x N subarray 
(obtained by only considering a selection of just £ of the n rows) defines a balanced cycle on the Cayley graph of 
G = with respect to some generating set for G (which may depend on the subarray). Specifically, if the entries of A 
are denoted Oij (with 1 ^ i ^ n and 0 ^ j ^ N — 1), then for every choice of £ distinct integers ii,... Ae G {1, • ■ ■, n-}, 
there is a generating set for G (which may, in general, depend on ii,... ,ie) such that if ^ = (oi^j,..., denotes 

the jth column of A restricted to rows ii,..., ii, then '£q = ... ,^^ defines a balanced cycle on r(^, 

An example of a BOA is shown in Fig. We defer the proof that BOAs exist to Sec. jV] The remainder of the 
current section defines a decoupling protocol based on BOAs and proves that it works to decouple .^-local Hamiltonians 
in n qudit systems (£ ^ n). Working with £ qudits (rather than n qudits), along with the promise that H is Olocal, 
will enable us to give an efficient protocol. 
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FIG. 4: Example of a BOA(24, 7, 2, 2), i.e. a BOA{N,n,q,£) with N = 24 columns and n = 7 rows on the finite field F„ = Z 2 
of order q = 2. Any subarray defined by any £ = 2 rows defines a balanced cycle on the Cayley graph r(5, y) of Q — = Z 2 

with respect to some generating set (which may depend o n the subarray). For example, rows 5 and 7 (highlighted) form a 
2 X 24 subarray that defines the balanced cycle shown in Fig. |2(b) The circled integers (red) correspond to the steps taken by 
the balanced cycle as shown in that figure. Note that typically in this paper, q = d? (for example, for qubits q — 4), but for 
simplicity, the BOA example shown here uses q = 2. The method by which this BOA was constructed is detailed in Example [T] 
of Sec. 


Protocol 2 (Efficient, bounded-strength balanced-cycle decoupling based on BOAs). Let A = 

[aj]j=o....,Af-i be a BOA{N,n,q,£) whose columns are denoted by the vectors oy = (oij,... ,anj)’^, where atj G F, is 
the (i,j) entry of A. For j = 1, ..., iV, let 6^ = aj — aj-i be the transitions between the columns, treating = Sq = 0. 
For j = 1,... ,N, set Uc{0) = 1 and 


C/,((j-l)A + <5) =uj^(<5)C/c((j-1)a), [0,A]; 


note that this implies that Uc{jA) = Ug- (for j = 0,..., N). The control cycle length is thus Tc = NA. 

Theorem 2. The above protoeol performs bounded-strength decoupling. 

Proof. H is an £-local Hamiltonian, H — with each Hk acting non-trivially on at most I qudits. Consider a 

term Hk, which acts non-trivially only on qudits denoted ..., and write Hk = hk® ln-^> where hk is understood 
to be a X d^ matrix acting only on these i qudits and In-e is the identity matrix on the other n — I qudits. By 
definition of a BOA, the ix N subarray of A restricted to rows ..., defines a balanced cycle on r(^, where 
is some generating set of ^ = F^. The idea of the proof is to observe that the protocol involving the columns dj 
for decoupling Hk is equivalent to a protocol involving the subarray’s columns for decoupling hk', since the subarray 
defines a balanced cycle, we can then invoke Protocolto successfully decouple hk and therefore Hk. 


Let 


= (' 


'iij ? • • • ? ^1^3 


j)’^ denote the jth column of A restricted to rows ii,...,ie and let ^ ^ ^ = 


(bi,^j,... ,bi^j)"’", where bij is the ith entry of bj. Then the cycle ^ is represented as '£g = and 

= (. 61 , . . . ,.t,Ar). 

As in the proof of Theorem^ we are interested in Uc{t)^HkUcft). The control unitary at time t = {j — 1)A -f d is 


- 1)A -b (5^ = u^^{S) Uc{{j - 1)A^ = Mg^(d) = {ub^.{5) ®---® Ub^.(S)) (Ua, 


0 - 1 ) 


•Ua 


•(i-i)) 


Thus, when conjugating Hk = hk® In-e by Uc{{j — 1)A -b d), all of the unitaries not acting on the £-qudit subspace 
of hk will commute through Hk and cancel, leaving only those corresponding to the £-qudit subspace, i.e. those 
corresponding to the labels j,j and ^ . Explicitly, 

l)A-b(5^ HkUc{{j-l)A + 5^ = «)•••«) /ife 

{ub,^,{5)®---®Ub^^^{5)^ ®tn-l 

= ul Ujs- {5)^ hkUjsM)Un 

iJj-i < 13-1 

Thus, the protocol of applying Uc to Hk is effectively the same as applying a protocol Uj,.(6)Un to hk, following 
the balanced cycle Since this is precisely the scheme defined in Protocolapplied to hk (see Eq. (|^), we conclude 
















from Theorem that it decouples hk- Consequently, 0 In-e = 0. This occurs for every term Hk in 

H = J2k whence H itself is decoupled: = J2k ^'k^ — 

□ 

Remark 5. Once we have a BOA scheme that can deconple a system of n qudits, the same scheme can be nsed (with 
the same BOA and therefore same length N) for a system oi n' < n qudits. This can be accomplished by simply 
ignoring n — n' oi the qudits, i.e. by having Uc act as 1 on these n — n' extra qudits (rather than as dictated by the 
original protocol). The proof of Theoremremains nnaffected becanse Hk acts trivially on these extra qudits, i.e. 
they are not acted upon by hk- 

Theorem showed that decoupling protocols based on BOAs work, with control cycle length proportional to the 
BOA parameter N. We next show that BOAs can indeed be constructed and, moreover, that the construction gives 
rise to an ejjicient decoupling protocol, in the sense that N does not increase exponentially with n. 


V. CONSTRUCTION OF BALANCED-CYCLE ORTHOGONAL ARRAYS 

The existence of balanced-cycle orthogonal arrays follows natnrally from constrnctions of orthogonal arrays gener¬ 
ated nsing classical linear codes, which we shall define shortly. We first give a brief outline of onr BOA construction. 
This constrnction is via the generator matrix G of a linear code, which is a linear mapping from to F^ for some 
k ^ n. If we ennmerate all elements of F^ in an arbitrary order and consider their image under G, this will form 
an OA of strength £ (for an appropriately chosen k). To obtain a BOA, we do this enumeration according to the 
prescription of an Eulerian cycle on F^. In doing so, we can guarantee that we always obtain a balanced cycle when 
we consider any snbmatrix of i rows, nltimately ensuring that any Glocal Hamiltonian term on those corresponding 
qndits will be deconpled. We now prove this, starting with a definition of a classical linear code. 

Definition 6 (Classical linear code). A classical linear [n,k]q code, G, is a /c-dimensional snbspace of the vector 
space fg. For any vector x = {xi,.. ., Xn)'^ & F^, define wt(a;) = |{i G {1,..., n} : Xi yf 0}|. The distance of a linear 
code G is defined to be min{wt(c) : c G G, c yf o}, where o denotes the zero vector. An [n,k]q linear code can be 

described by a generator matrix G of size n x k with entries from F^. G maps the vectors m G onto the elements 

{code words) of G so that G = G[F^] = {Gm G fg : m G FJ}- 

The dual code G"*- of G is defined by G-*- = {y G fg : x ■ y = 0 yx G C} with the dot product x ■ y = 

The dual code is also a classical linear code, namely an [n,n — k]g code with some distance that we will refer to 
as the dual distance. Orthogonal arrays can be constructed from linear codes, as the following theorem m Theorem 
4.6] establishes. 

Theorem 3 (OAs from linear codes). Let C be a linear [n,k]g code with dual distance The n x matrix 
[Gruj^gipfe, whose columns are the vectors Gm G F” (Vm G fg), is an OA{q^,n,q,£) with strength £ = 6^ — 1. 

Let G be an [n, k]g with dual distance J-*- = £ + 1 and generating matrix G. Let C be an Enlerian cycle on the Cayley 

graph r(F^,4'''^), where 5^^ is a generating set for F*; thns, we can write Cfk = (mo,... ,mjv_i) with transitions 
Ga(k) = (si,... ,sn) and N = q'^ Becanse c? is a prime power, say d = for some prime p, the minimal 

Oq 

generating set is of size = 2ke. We are interested in the image of the cycle in the codespace; thus, consider 

the Eulerian cycle, denoted G£, on r(G[Fq], GJS'q*^]), where G[Fg] = G C F^ is the image of F^ under G, i.e. is the 
codespace. In other words, GL^k = (Gmg,..., Gm^v-i) and GC„{k) = (Gsi,..., Gs^r). 

To avoid possible confnsion, we emphasize here that although we will use GC to construct a BOA, neither C nor 

GC will serve as the balanced cycle to which Theorem applies (which is why we have nsed the notation C rather 

than )6). Rather, for our efficient decoupling scheme, we constrnct an array Aqc, dictated by GC, and prove that 
the resnlt is a BOA by showing that if we consider any subarray of £ rows, it gives rise to some balanced cycle ^ on 
Q = fg. The notation and relationships of the various groups and cycles nsed in this paper is sketched in Fig. 5 

We turn GC into an array Aqc the obvions way as follows. Each element Gmj of GCfk is a column vecror in 
F”. Therefore we may associate to GC the nx N matrix Aqc = [Gm]m^c^k with elements a^- = {Gmj)i, so that the 

jth column of Agc is the vector Gmj, and the columns are arranged in the order of the Enlerian cycle GC. Note that 
since we assnmed that Enlerian cycles always start with the (additive) identity element, i.e. the zero vector o G F^, 
and since G maps the zero vector to the zero vector (Go = o G F”), the first colnmn of Aqc is the zero vector of F^. 
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FIG. 5: Names and relationships between various groups and cycles in the BOA construction. The graph is a schematic of a 
Cayley graph. As explained in the text, the BOA, Aqc, is the array form of GC, which is the result of mapping an Eulerian 
cycle C under the linear code generating matrix G. As a BOA, Aqc has the property that if one considers a subarray of £ rows, 
the result describes a balanced cycle. Specifically, let I C {1, 2,..., n} be a subset of £ indices. {Gm)i denotes the Gtuple of 
elements of Gm (itself an n-tuple) corresponding to the indices I. The cycle composed of nodes {Gm)i (in the same order 
in which C was composed of m), is shown to be a balanced cycle. 


Lemma 1. Aqc is an OAff/g£{N,n,q,i) with N = 

Proof. By Theorem an array whose q^ columns are the vectors of the codespace is an OA. The columns of Aqc are 
precisely copies of each vector in the codespace, and therefore (using Remark |^, Aqc is an OA. □ 

Let 3 G Sq^\ Gs G so Gs = ^(Gs)i,..., (Gs)„^ , where we use the notation (Gs)i G F,j to denote the zth 
component of the column vector Gs. Fix £ distinct numbers ii,... ,ie G {1 ,... ,n} and write I = {fi,..., ii\. Let 
(Gs)/ denote the Gtuple ^(Gs)ij,..., {Gs)i^ . Let = |(Gs)/ : s G 

Lemma 2. is a generating set for ^ = F^. 

Proof. Let ^ & G- By definition, since Aqc = \Gm]m&c^k OA of strength £, the £x N subarray obtained by only 
considering rows contains each possible Gtuple of elements of F^, and therefore contains Thus, 3 Gm such 

that {Gm)i = Since 5'^^ is a generating set for F^, 3 ui,...,Ur G such that m = ui A ■ ■ ■ + Ur, and therefore 
Gm = Gui + • • • + Gur. But then {Guj)i G for every j = 1,..., r and ^ = (Gm)i = (Gui)/ + • • • + (Gur)i, whence 
generates Q. □ 

Recall Aqc = [Gm]m^c k consider the £xN submatrix Ag — [n ] of Aqc, whose jth column is a. = {Gmj)j G G. 

Fq Oj Oj 

Define the ordered list '£g = Although ^ depends on I, we suppress mention of this for notational 

simplicity. 

Lemma 3. ^ is a balanced cycle on T{Q,S^). 

Proof. Each ^ G ^ is present in '£g an equal number of times because Aqc is an OA of strength £. The transitions in 
this cycle are j,j = ^ ^ ^ = {Gmj)j — (Gmj-i)i = [Gsj)i G dA', so the transition representation = (/>!,... ,/>//) 

consists of generators from ,5^; ^ is therefore a cycle on the Cayley graph r(^, d^). Moreover, because C is an Eulerian 
cycle and Aqc is an OA, ^ is a balanced cycle (although not an Eulerian cycle): informally, each Gm» ^ - > occurs in 
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GC an equal (non-zero) number of times (namely once, independent of Gm) for each Gs, so each occurs in an 
equal (non-zero) number of times (independent of ^ = {Gm)i, since Aqc is an OA) for each ^ = {Gs)i. 

Explicitly, consider any ^ G ^ and jt> G ■ Let = {m G : {Gm)i = ^}. C<^k is an Eulerian cycle so each element 

in F^' shows up precisely |S'q^^| times in C^k. In particular, therefore, each m G appears precisely times in 

Cfk, and consequently, ^ shows up in ^ precisely times. But Aqc is an OA of strength £, so must 

then be independent of and therefore \M^\ is also independent of Since i, G o5^, let Sj^ = {s G : (Gs)i = .6}. 
This set is non-empty by definition of . In general, |S)i,| may depend on Now, Vm G and Vs G Sj^, the Eulerian 

property of C guarantees that Gm» ^ - ^> occurs precisely once in G£. Therefore, occurs in '£ precisely |S'^||M^| ^ 1 
times, which is independent of Thus is a balanced cycle. □ 

Together, the above lemmas prove the existence of BOAs and how to construct them from classical linear codes. 

Theorem 4. Let G, C, and Aqc he as above, i.e. G is an [n, k]q code with dual distance 5'^ = I ^ and generating 
matrix G, C is an Eulerian cycle on the Cayley graph r(Fg,S'g*^), written L^k = (mo,... ,mN-i) and Cg(k) = 
(si,. .., Sat), and Aqc = [Gm]mec,,rk whose columns are the vectors Gmj. Then Aqc is a BOA(N,n,q,i) 

with N = q'^\S^q'"^\. 


Proof. For every choice of £ distinct integers I = {ii,...,i(f} C {!,...,n}, the set = |(Gs)/ : s G is a 

generating set for Q (by Lemma |2 1 such that if ^ denotes the jth column of Aqq restricted to rows ii,...,ii, then 

(by Lemma 1^ V ^ balanced cycle on r(^,c5^). □ 

For n interacting qudits of dimension d = p^ (for some prime p and positive integer e) that obey an Glocal 
Hamiltonian, this construction therefore allows 


N = q'^\sf^=q^2ke (4) 

where k is the dimension of the code used and q — d^. Observe that the BOA decoupling protocol (Protocol]^ for 
this BOA construction has a control cycle length of Tc = NA = d^^2keA where A is some fixed length of time. For 
example, in the qubit (d = 2) case discussed above, |S'4^^| = 2k, whence Tc = NA with N = (2fc)4^. To maximize 
efficiency for a given n and £, one should select a code that minimizes k (equivalently, select a dual code that maximizes 
k^ = n — k). 

There exist many good families of classical linear codes. For instance, for 2-local interactions, we can (as was done 
in [12] for OAs) rely on [n,k]g Hamming codes with dual distance 3 such that k = log^ ((g — l)n -I- l); our scheme 
then has N scaling like nlog(n). This protocol is therefore much more efficient than a naive protocol of applying 
balanced-cycle decoupling (including Eulerian decoupling) without exploiting the Glocal structure of the Hamiltonian, 
which would have a control cycle length that scales exponentially with n. It is also more efficient than the method of 
|5|, which required N = , i.e. whose scaling for this case (£ = 2, using Hamming codes) is quadratic in n. Next, 

we address codes for BOA construction with values of £ greater than 2. 


VI. BOA DECOUPLING SCHEMES FROM BCH CODES 

In this section we show how to construct schemes that achieve decoupling for Glocal Hamiltonians on TL = (C'^)®" 
for arbitrary £, n, and prime power d. Besides the machinery of balanced-cycle orthogonal arrays (BOAs) that was 
introduced in the previous sections, our construction relies on BCH codes as a particular vehicle to construct good 
BOAs. The choice of BCH codes results from the fact that they are among the best known codes for the particular 
situation where the distance is a fixed, small number and the goal is to maximize the overall code dimension. Using 
the dual of a BCH code when constructing the corresponding orthogonal arrays, we obtain schemes with a designed 
OA strength (i.e. locality £) while having a small N in the corresponding decoupling protocol. We begin by briefly 
recalling some basics about BCH codes; for more details on finite fields and BCH codes see, for example, the textbooks 

HMD- 
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Definition 7 (BCH code). Let a be a primitive n-th root of unity in the finite field where q is a prime power, 
n ^ 2, and m ^ 1. A BCH code over of length n and designed distance D, where 2 ^ ^ n, is a cyclic polynomial 
code defined by the zeros 




h+D-2 


where 6 ^ 1 is a positive integer. 

The generator polynomial g(x) of the cyclic code introduced in Definition is given by g{x) = 
lcm(Mf,(a;), Mb+i(a:),..., M;i_|_£i_ 2 (x)), where Mi{x) denotes the minimal polynomial of a* over F,. Note that even 
though the zeros of the code lie in an extension field F^m over F^, the BCH code itself is a cyclic code over the ground 
field Fq. Furthermore, it is known that a BCH code defined this way has a distance 6 that is at least D, which is 
why D is sometimes called the “designed distance.” Note that the actual distance <5 of the code might exceed D. The 
possible lengths of BCH codes are quite restricted, as any admissible length n must be a divisor of the order of the 
multiplicative group of fgm, i.e. must be a divisor of q™ — 1. In the following we restrict ourselves to the case where 
n = q"* — 1, which is called the case of primitive BCH codes. Furthermore we only consider the case where 6=1, 
which is called the case of narrow-sense BCH codes. We denote these codes by BCH(Fgm/Fg, D), and we note that 
they always exist. 

For any linear error-correcting code C = [n,k,d]g of length n, dimension k, and distance 6 , an extension C = 
\n-\- 1, fc, 5' ^ (5]q can be defined by adding another coordinate and an overall parity check. At the level of parity check 
matrices, this corresponds to appending the parity check matrix M ol C with an all-zeros column 0 and an all-ones 

. For binary codes, the distance of the extension is easy 

to characterize: if 6 = 0 mod 2 then S' = 5 and if 6 = 1 mod 2 then 6' = 6 -f 1. In general over larger alphabets, 
however, it is possible that the distance increases even when S is even. When applying an extension to the BCH codes 
introduced above, we use the notation BCH®’^*(Fqm/F^, D). We make use of the following theorem about such codes. 


row so that C has the new parity check matrix 


Theorem 5. Let F^ be a finite field and let BCH®’'*(Fgm/Fg, D) = [n,k,S]q be the extension of the primitive narrow- 
sense BCH code with designed distance D constructed in Definition^ so n = q™ and S ^ D. Assume that D ^ 
g\m/ 2 \ _|_ 2 Then the dimension k of the code satisfies k ^ n — m \— 2)] — 1 ^ n — m{D — 2) — 1. 

See [T5] for a proof of Theorem that leverages the fact that the extended primitive narrow-sense BCH codes are 
subfield subcodes of the Reed-Solomon codes. See also dSl Problem 8.12] and [IT]. By combining Theorem with 
the construction of Theorem we now obtain the following result regarding bounded-strength decoupling for Alocal 
Hamiltonians. 


Theorem 6 . For any £ 2, n ^ — 1)^, and q = df with d ^ 2 a prime power, there exists a BOA{N, n, q, £) whose 

length N scales as N = 0{n^~^ \ogn). That is, there exists a bounded-strength BOA decoupling scheme to switch off 
£-local Hamiltonians on n interacting d-dimensional qudits that uses N = 0{n^~^ logn) time slices. 

Proof. First, note that if n is not of the special form n = q"* where m ^ 1, then we can always embed the n qudits into 
a larger system of q™ qudits with m = [log^ (n)], construct a scheme for the larger system, and ignore the additional 
qudits (as per Remark [5). This increases n by a factor of at most q and therefore doesn’t affect the statement of the 
theorem, i.e. we can without loss of generality assume that n = q™ where m ^ 1 . 

Now, we consider the code C that is the dual of a /c-*--dimensional BCH®’‘*(Fqm /fq,D) code with designed distance 
D = £-\-l. Thus C has length n, dual distance J-*- D = £-\-l, and, according to Theoremj^ dimension k = n — k^ ^ 
m{D — 2) -I- 1 = m{£ — 1) -I- 1. By Theorem]^ this means that we can construct an n x Nqa orthogonal array of 
strength 5^ — \^£ from this code, where Nqa = 9^ ^ According to Theorem|^ the corresponding 

BOA has an overhead that scales at most logarithmically in n since from Eq. Q we obtain the following bound on 

the length of the bounded-strength decoupling scheme corresponding to the BOA: Nboa = = q^2ke ^ 

[qn^“^][2(m(£ — 1) -f l)e] = 2qen^~^[{£ — 1) log^ n -|- 1] = 0{n^~^ logn). This establishes the claimed bound. □ 

In physical systems, the locality £ is generally a small fixed number, so the requirement oi {£ — 1)^ is inconse¬ 
quential asymptotically, while for small n, one can (by Remark]^ always artificially increase n to satisfy it. Our main 
focus in Theoremis on the asymptotic cost for fixed locality £ as the number n of qudits grows. It should be noted 
that, depending on the particular choice of q, £, and n, further improvements over the bound in Theorem]^ are possi¬ 
ble; see e.g., HZlllH]. This in turn leads to further improvements in the length of the decoupling schemes constructed 
via Theorem]^ For instance, for 2-local qubit Hamiltonians we saw at the end of Sec. jVjthat Hamming codes can 
be used to construct BOA decoupling schemes of length N = 2[3n -I- 1] log 4 [ 3 n -f 1], giving a slight improvement over 
schemes constructed from primitive BCH codes which lead to a scaling of ^ 8 n[log 4 (n) -|- 1]. 


12 





£\N 

64 

384 

2 048 

10 240 

49 152 

229 376 

1048 576 

2 

2-5“ 

6-21“ 

22-85“ 

86-341“ 

342-1365“ 

1366-5 461“ 

5 462-21 845“ 

3 

- 

3-6'’ 

7-17“ 

18-41“ 

42-126“ 

127-288“ 

289-756“ 

4 

- 

- 

4-5 

6-11’' 

12-21“ 

22-43 

44-85 

5 

- 

- 

- 

5-6 

7-12-'' 

13-20 

21-27 

6 

- 

- 

- 

- 

6-7 

8-9 

10-17 

7 

- 

- 

- 

- 

- 

7-8 

9-10 

8 

- 

- 

- 

- 

- 

- 

8-9 


TABLE I: Table of the best known balanced-cycle orthogonal arrays (BOAs) for qubit (d = 2) systems, indicating the number 
of qubits that can be decoupled by a BOA scheme for the given locality and length. Shown are the locality ^ of the underlying 
Hamiltonian from 2 up to 8 and length = 4*’2fc of the BOA cycles from 64 up to 1048 576, corresponding to the values 
k = 2,... ,8 in Eq. 0 with g = 4 and e = 1. Each entry in the table denotes the range of the number n of qubits that 
can be achieved by a BOA scheme of the corresponding locality and length. For instance, the entry 7-17 at location (3, 2 048) 
indicates that in order to decouple a 3-local Hamiltonian on a system with n qubits, where n € {7,..., 17}, the best known BOA 
schemes have 2 048 time steps. If the number of qubits is one higher, e. g., n = 18, then the currently best known BOA scheme 
would require 10 240 time steps. Superscripts indicate if the dual codes [n, fc^,d ^]4 underlying the BOAs were obtained by a 
particular construction: a) all codes for 1 = 2 were obtained from the Hamming code family [n, n — k, 3)4 with k = log4(3n-|-1); 
b) the code [6, 3, 4)4 is the Hexacode |13| : c) the codes with parameters [17,13, 4 ) 4 , [41,36, 4 ) 4 , [126,120,4)4, [288,281,4)4, and 
[756, 748, 4)4 are based on caps in finite projective spaces which are sets of points of which no three are collinear, see |18| : d) the 
code [ 11 , 6 , 5)4 is a quadratic residue code, see |lll 5.13] and |13| : e) the code [21,15, 5)4 is the Kschischang-Pasupathy code, 
see [20| : and f) the code [12, 6 , 6)4 is a quadratic residue code, see [111 5.13] and [13] . All other codes in the table are based on 
the database of best known linear codes that is available in Magma m- 


£\N 

324 

4 374 

52 488 

590 490 

6 377 292 

66 961566 

2 

2-10“ 

11-91“ 

92-820“ 

821-7381“ 

7 382-66 430“ 

66431-597871“ 

3 

- 

3 - 10 '’ 

11-82'’ 

83-212'’ 

213-840'’ 

841-6 723'’ 

4 

- 

- 

4-10 

11-20 

21-72 

73-96 

5 

- 

- 

- 

5-10 

11-16 

17-73 

6 

- 

- 

- 

- 

6-10 

11-17 

7 

- 

- 

- 

- 

- 

7-10 


TABLE IL Table of the best known balanced-cycle orthogonal arrays (BOAs) for qutrit {d = 3) systems, indicating the number 
of qutrits that can be decoupled by a BOA scheme for the given locality and length. Shown are the locality £ of the underlying 
Hamiltonian from 2 up to 7 and length N = 9*2A: of the BOA cycles from 324 up to 66 961566, corresponding to the values 
A: = 2,..., 7 in Eq. 0 with q = 9 and e = 1. Each entry in the table denotes the range of the number n of qutrits that can 
be achieved by a BOA scheme of the corresponding locality and length. Superscripts indicate if the dual codes [n,A:^,5^]9 
underlying the BOAs were obtained by a particular construction: a) all codes for 1 = 2 were obtained from the Hamming code 
family [n, n — k, 3)9 with k = logg(8n -I- 1); and b) the codes with parameters [10, 7, 4 ) 9 , [82, 78, 4 ) 9 , [212, 207, 4 ) 9 , [840, 834, 4 ) 9 , 
and [6 723, 6 716, 4)9 are based on caps in finite projective spaces, see |18| . All other codes in the table are based on the database 
of best known linear codes that is available in Magma |19| . 


VII. TABLES OF BEST KNOWN BOA SCHEMES FOR SMALL SYSTEMS 

In the following, we present a summary of the best known BOA schemes for qubit (d = 2, g = 4) and qutrit 
(d = 3, g = 9) systems for a variety of small localities £ and system sizes n. All schemes are obtained by our 
main construction in Theorem where the underlying classical linear codes are either taken from the literature 
or from the Magma m database of best known linear codes which can be accessed using the Magma command 
BestDimensionLinearCode(<field>, <length>, <distance>). 

Recall from Remark l^that if we have a BOA decoupling scheme for n qudits, it can also be used for smaller systems 
of n' < n qudits. Therefore, the best known BOA for n qudits is also the best known BOA for all n' < n qudits unless 
a better BOA scheme for n' is known. Table |T] summarizes the best known schemes for systems of n qubits (d = 2), 
for small values of n, that can be obtained from good linear codes. Similarly, Table |TT] summarizes the best known 
schemes for systems of n qutrits (d = 3), for small values of n. 
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VIII. EXAMPLES 


Example 1 (2-local decoupling of a diagonal Hamiltonian). We first consider a simple case of decoupling a 
2-local Hamiltonian on 7 qubits, where we assume (to simplify the example) that the Hamiltonian is diagonal, i.e. 
consists only of Pauli Z operators. In this case, it turns out that we need not use an irreducible representation and 
can consequently use q = d = 2 (instead of g = = 4); to avoid clutter, we defer the proof that this works to 

Example 1^ where we will consider a similar situation. Because q = 2, we use the group F 2 = Z 2 = {0,1} and choose 
the representation 


p : Z 2 —>■ {1, A}, with p(0) = 1, p{l) = X 
and corresponding control unitaries 


uo(<5) = 1) ui{S) = e over time (5 G [0, |]. (5) 

Note that by evolving over time A = f, we can therefore implement (up to phase) Uo(A) = 1 = p(0) and Ui(A) = 
X = p(l). We assume that we can perform these control unitaries on any qubit. 

With our constraints of 7 qubits (n = 7) with 2-local interactions (£ = 2) and our ability to use q = 2, we seek an 
[n, k]g = [7, k ]2 code with dual distance J-*- = 3 = f -|- 1 for some (hopefully small) dimension k. We find that there is 
a [7, 3]2 code with this desired dual distance, given by the generator matrix 


G = 


1 0 0 
0 1 0 
1 1 0 
0 0 1 
1 0 1 
0 1 1 
1 1 1 


Observe that this code has dimension k = 3, which will dictate the efficiency of the protocol. 

Although irrelevant for our concerns here, one may observe that, as guaranteed by Theoreman array built from 
the codewords of this code is an orthogonal array; indeed, the OA shown in Fig. was constructed from this code. 
We, on the other hand, wish to create a BOA from this code. As per Theorem we start with the (additive) group 


¥g = Z 2 and choose the generating set = S' 2 ^'’ = 


:(3) 


Cayley graph P^Z^, 5*2^^^ shown in Fig. 


2(a) 


namely 


We set £ to be the Eulerian cycle on the 


= 






We map this cycle under the action of the generator matrix G to obtain the array Aqc = [Gm\meCf^ 3 j which is 
precisely the array that was shown in Fig. According to Theorem this is a BOA{N,n,q,£) = HOA(24, 7, 2, 2) 
with N = q^\S^'^\ = 2^ • 3 = 24. By definition, this means that every i x N subarray (obtained by only considering a 
selection of just i of the n rows) defines a balanced cycle on the Cayley graph r(^, of ^ = Z 2 with respect 

to some generating set (which may depend on the subarray). For example, look at rows 5 and 7, highlighted in 

Fig. 0 This defines the balanced cycle on T ^Z^, | | ^ that was shown in Fig. | 


2(b) 


The generating set and 

balanced cycle depend on the choice of rows, but by virtue of being a BOA, some balaiiced cycle will be obtained for 
any choice of 2 rows. 

According to Theorem the protocol of Protocol defined by this BOA performs bounded-strength decoupling 
on our 7-qubit 2-local system. To construct this protocol, we consider the transitions between the columns of the 
BOA, defining the schedule shown in Fig. The control unitaries to be applied are defined by these transitions and 
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1 
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It 


WfcisCi^) 0 0 WfcjjCiJ) = e ''^3'^e 

for time A = - over timeslot [—, —] 

2 L 9 ’ 2 -■ 


FIG. 6 : A 7 X 24 array defining the decoupling protocol in Example in the format of Fig. Rows correspond to qubit 
numbers, columns correspond to time slots (each of width A = ^), and entries correspond to unitary operators on qubits 

according to Eq. As per Protocol the control cycle evolution is Uc — 1) A + 5^ = u^_ {S)Uc (^{j - 1 ) A^ , ^ e [ 0 , A], where 

bj is the jith column. For example, because &5 = (0,1,1,0,0,1,1)^, we have (5) = uq (g) ui ( 8 ) ui < 8 ) Mo ® uo ® ui < 8 ) ui (5) = 

^-iX2S ^-iX^S ^-iXgS ^-iXjS 


our choice of Eq. which was chosen to be consistent with our representation p. For example, in time slot 5, the 
transition column is 65 = ( 0 , 1 , 1 , 0 , 0 , 1 , 1 )^, which corresponds to the unitary 




= g-i^2S^-iX3S^-iXe5^-iXTS 


where Xi denotes the Pauli X operator on the fth qubit. As per Protocol the control cycle evolution is 
C^c((j - 1)A + (5^ = u^_{S)Uc(^{j - l)Ay S G [0, A], where bj is the jth column in Fig. This protocol will de¬ 
couple any 2-local 7-qubit diagonal Hamiltonian. 

We epmhasize that in this simple diagonal-Hamiltonian example, we were able to use q = d = 2 (for reasons that 
will be addressed in Example . If the Hamiltonian were not known to be diagonal, this would not in general have 
been possible, and we would have needed to instead use a [7, k]^ code for g = 4. 


Example 2 (2-local decoupling using a Hamming code). Consider an arbitrary 2-local Hamiltonian H on a 
system of 5 qubits. Then H can be decoupled by applying a BOA derived from the code dual to a [5,3, 3(4 Hamming 
code, namely the [5, 2(4 code over F 4 with the generator matrix 


G = 


1 0 

0 1 

1 

a2 

1 


where a is a primitive element of order 3 of F 4 . Note that we arrange the code words as column vectors, consistent 
with the notation used throughout this paper and some - but not all - of the literature. Since here fc = 2, d = 2, and 
e = 1, the corresponding BOA has a total number of time steps given by A = d?^2ke = 64. When arranged into the 
columns of a 5 x 64 matrix, each of the 64 control Hamiltonians that are applied in this scheme corresponds to one 
of the 16 code words of the [5, 2)4 code. 

Example 3 (5-local decoupling of a diagonal Hamiltonian using a BCH code). Recall from Remark [^that 
if one is interested in decoupling a Hamiltonian of a particular form, it may not be necessary for p to be irreducible, 
and in such a case it may be possible to choose a code over a field F^ for which q is less than d^. Consider a diagonal 
(i.e., Z-only) 5-local Hamiltonian id on a system of 16 qubits. Then id can be decoupled by applying a BOA derived 
from the dual code of a BCH®’^*(F 2 /F 2 , 6 ) = [16, 7, 6 ] 2 , i.e. from a code over F 2 with parameters [16, 9]2 and generator 
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matrix 


G = 


100000000 
010000000 
001000000 
000100000 
000010000 
000001000 
000000100 
000000010 
000000001 
110 0 1110 0 
0 110 0 1110 
0 0 1 1 0 0 1 1 1 
110 10 1111 
101001011 
100111001 
1 0 0 0 1 0 1 1 1 


Due to the special structure of the Hamiltonian, we are able to choose q = d = 2 (rather than q = = 4) in this 

case. Since k = 9, d = 2, e = 1, and the Hamiltonian is Z only, the corresponding BOA has a total number of time 
steps given by = d^ke = 4608. When arranged into the columns of an 16 x 4 608 matrix, each of the 4 608 control 
Hamiltonians that are applied in this scheme corresponds to one of the 512 code words of the [16, 9]2 code. 

To construct our protocol from this code we first choose a generating set for F®, such as the k = 9 standard 
basis vectors {(1, 0, 0,.. , (0,1, 0,0,.. .)^,...}. We then find an Eulerian cycle C on the Cayley graph r(F|, 

and map it to an Eulerian cycle GC on the Cayley graph r(G[F 2 ], G[S' 2 ^^]) using the generator matrix above. Our 
choice of as being the standard basis vectors would dictate that the transition labels b = Gs, for s G are 
simply the columns of G. Our BOA consists of the 2® = 512 16-bit code words, each appearing exactly 9 times 
according to the order specified by GC. To use the BOA as a decoupling scheme, we may choose 


p : {0,1} ^ {1, X}, with p(0) = 1, p{l) = X 


and choose the corresponding single-qubit control unitaries to be 

uo{d) = 1, ui(<5) = over time S G [0, §]. 


Observe that (ignoring global phase) uo(§) = 1 = p(0) and Ui(^) = X = p(l). The multi-qubit control unitaries are 
defined by ug = Ub^ O • • • ® Ub^^ (for b G {0,1}^^). For example, if s = (1,0,0,.. .)^, then b = Gs is the first column of 
G and ug = ^- 1 X 146 ^-iXieS acting non-trivially on qubits 1, 10, 13, 14, 15, and 16. The 

control scheme in Protocol [^ is thus specified. 

We now prove that this example works, even though p is reducible (i.e. even though we are choosing q = 2 rather 
than q = 4). As per the argument in the proof of Theorem we need only focus on a single 5-local term of H (so 
assume without loss of generality that H consists of only one such term), we can ignore all but the 5 qubits on which 
it acts non-trivially, and we need only speak of the 5-qubit unitaries u^(5) that act on those qubits and correspond 
to G (where is the generator set for F^ derived from the BOA for those 5 qubits). According to the proof of 


Theorem 1 our scheme works if and only if Hg = 0. Here, however, p is not irreducible, so Hg will not 

suppress ml traceless operators; indeed, A-only operators commute with each Un and are therefore unmodified by Hg. 


that each term in is of the form 

u\Huj,. Now, H is diagonal, i.e. a tensor product of only 1 and Z, and Uj,{5) is a tensor product of only 1 and 
Therefore, because Ze~'^^^ = cos(2i5)Z-l-sin(2(5)y, we see that ulHujy can be expanded as a sum of tensor 
products of 1, Z and Y. Moreover, because H is traceless and conjugation by a unitary is trace-preserving, this sum 
cannot contain a term proportional to the identity, 1®^. Thus, each of these terms consists of at least one operator 
(which for notational purposes we take to be on the first qubit) that is a .Z or a E, i.e. each can be written in the form 
a 0 A, where a G {Y, Zj and A is some 4-fold tensor product of operators from {1,Y,Z}. Our protocol is defined by 


To show that Ug ^F^(H)'j = 0 nevertheless holds, observe from Eq. 
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a BOA of strength 5, so any subset of 5 rows of the BOA consists of all 2® 5-tuples in repeated an equal number of 
times. Thus the sum in Hg involves conjugating by each where ranges over all 2® possible tensor products that 
can be formed on 5 qubits using 1 and X. Focusing on the first qubit, we can equivalently say that the range over 

all possible 1 ®B and X ® B, where B ranges over {B 2 O i ?3 O i ?4 O i ?5 : i?i € {1, A}}. Conjugating ct O A by 1 Oi? 
yields either it O A or —cr O A, whereas conjugating instead hy X ^ B yields the same result but with the opposite 
sign (since a G {Y, Z}). In other words, (1 ®B){a O A)(II ®B) + {X Z) B){a ^ ^)(A1 Z) B) — 0. Thus, the sum in Bg 

cancels in pairs, i.e. Ilg(Fy{H)] is indeed 0. 


IX. CONCLUSION 

We have shown how to use bounded-strength controls to decouple n interacting qudits of dimension d = (for 
some prime p and positive integer e) that obey an £-local Hamiltonian. The system may be either closed or open (i.e. 
coupled to an environment), as long as both the system Hamiltonian and the environmental couplings are ^-local on 
the system. The decoupling scheme is described using a balanced-cycle orthogonal array, which we introduced and 
showed how to construct from classical linear codes. To determine the best possible scheme based on our method, we 
have to find the best linear error-correcting code C"*- = [n, k^]q of length n and distance at least £-1-1. By the best, we 
mean k-^ should be maximized for the given system size (n) and locality (£). The construction in the present paper 
yields a decoupling scheme that uses N = d?'^2ke time slices (of fixed length) where k = n — k^. 

Finding the best code is a key problem in the theory of error-correcting codes; extensive code tables have been 
compiled for small distances. For the important case of qubits with 2-local interactions, for example, one can use 
Hamming codes over F 4 such that k = log 4 ( 3 n -|- 1), whence N scales like nlogn. For higher degrees of locality, we 
can use families of BCH codes to construct the decoupling schemes. The designed distance of these codes is chosen 
based on the locality £ of the Hamiltonian, leading to a scaling of TV as logn. An open question is whether the 
schemes so derived are optimal in the asymptotic sense, i.e. whether, for fixed £ and qudit dimension d, a better 
scaling with n is possible. We note that it is known that when using bang-bang pulses, time at least H(n) is 
necessary to decouple general 2-body Hamiltonians, whereas our bounded-strength scheme takes time O(nlogn) using 
Hamming codes for such Hamiltonians. Another interesting open question is to develop a theory for systems with 
mixed qudit dimensions. All schemes derived here are decoupling schemes up to first order, and while it is easy to 
extend this to second order using symmetry, it would be interesting to find schemes that also achieve decoupling to 
higher orders. Finally, we mention as an avenue for future research the application of the derived bounded-strength 
decoupling schemes for the purpose of Hamiltonian simulation. 
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